Apparatus and method for transceiving scene composition information in multimedia communication system

ABSTRACT

A method for transmitting scene composition information from an apparatus therefor in a multimedia communication system is provided. The method includes generating scene composition information comprising media attributes information and temporal information, and transmitting the scene composition information, wherein the media attributes information and temporal information is separated into different formats.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application is a U.S. National Stage application under 35 U.S.C.§371 of an International application filed on Feb. 4, 2013 and assignedapplication number PCT/KR2013/000890, which claimed the benefit of aKorean patent application filed on Feb. 2, 2012 in the KoreanIntellectual Property Office and assigned Serial number 10-2012-0010870,the entire disclosure of which is hereby incorporated by reference.

JOINT RESEARCH AGREEMENT

The present disclosure was made by or on behalf of the below listedparties to a joint research agreement. The joint research agreement wasin effect on or before the date the present disclosure was made and thepresent disclosure was made as a result of activities undertaken withinthe scope of the joint research agreement. The parties to the jointresearch agreement are 1) SAMSUNG ELECTRONICS CO., LTD. and 2)UNIVERSITY-INDUSTRY COOPERATION GROUP OF KYUNG HEE UNIVERSITY.

TECHNICAL FIELD

The present disclosure relates to a device and a method fortransmitting/receiving scene composition information in a multimediacommunication system. More particularly, the present disclosure relatesto a device and a method for transmitting/receiving scene compositioninformation created in a form in which media attribute information andspatial-temporal information are separated from each other.

BACKGROUND

With the development of digital technologies, significant advances havebeen made in broadcasting-communication convergence. For example, aconsumption environment for multimedia services such as images, music,and entertainment media has been expanded throughbroadcasting-communication convergence media such as an internetprotocol tele-vision (hereinafter, referred to as ‘IPTV’), a smart TV,and the like. The expansion of the consumption environment for themultimedia services has been made through the progression of amultimedia communication environment from the existing environment. Thatis, progression from an environment in which a service for one media isprovided in one direction to an environment in which a variety ofinformation can be simultaneously consumed due to the development ofterminals and communication technologies. Also, the expansion has beenthrough the generalization of media consumption for a consumer to obtainonly desired information.

Scene composition information is provided together with a multimediaservice. When various multimedia services, including a video service andan audio service, are provided, scene composition information representsspatial information and temporal information. The spatial informationrepresents an area that is occupied by each media included in themultimedia service, and the temporal information represents a time pointwhen each media has to be provided. The scene composition information ispresented in the form of a markup language for explanation of anassociative relationship between the media. The scene compositioninformation is mainly presented using extensible markup language (XML)in order to express the spatial-temporal information for the variousmultimedia services. The scene composition information may also be usedfor multimedia services provided in a hybrid broadcasting environmentutilizing the existing broadcasting network and internet network as wellas the existing broadcasting environment. Accordingly, demands for scenecomposition information suitable for a multimedia environment such asthe hybrid broadcasting environment are emerging.

In order to provide a multimedia service in a broadcasting environment,information required for identifying whether media data provided by thecorresponding multimedia service can be used and for accessing the mediadata, and information required for accurately analyzing and displayingreceived media data are needed in an initial stage when the multimediaservice is provided. Here, the information required for identifyingwhether the media data can be used and for accessing the media data, andthe information required for accurately analyzing and displaying thereceived media data correspond to media attribute information.

However, scene composition information provided in the existingmultimedia service is implemented in the form in which media attributeinformation is presented along with spatial-temporal information.Therefore, when the existing scene composition information is used toprovide a multimedia service in a broadcasting environment, a process ofidentifying whether media data can be used and preparing for displayingof the media data may be performed after the scene compositioninformation is completely interpreted. Furthermore, when identical mediadata is repetitively used for composing the corresponding scene, mediaattribute information is also repetitively generated in spite of thesubstantially identical media data.

Moreover, there exists one XML file per scene in the existing scenecomposition information. Therefore, when the entire scene is changed, anew XML file is used. Accordingly, when media data in the previous sceneis continued in the next scene, continuity of the corresponding mediadata cannot be ensured. Consequently, it may be a problem to apply, inthe same way, the existing scene composition information to abroadcasting environment in which media data has to be continuouslyreproduced.

In addition, as a scene is composed based on media data in the existingscene composition information, when some parts of the scene areindependently composed, a complicated presentation is accordinglyrequired. Also, when some parts of the scene are changed, a new XML fileis required. However, such a new XML file cannot ensure the continuityof the media data as described above.

The above information is presented as background information only toassist with an understanding of the present disclosure. No determinationhas been made, and no assertion is made, as to whether any of the abovemight be applicable as prior art with regard to the present disclosure.

SUMMARY

Aspects of the present disclosure are to address at least theabove-mentioned problems and/or disadvantages and to provide at leastthe advantages described below. Accordingly, an aspect of the presentdisclosure is to provide a device and a method fortransmitting/receiving scene composition information in a multimediacommunication system.

Another aspect of the present disclosure is to provide a device and amethod for transmitting/receiving scene composition information, createdin a form in which multimedia attribute information and spatial-temporalinformation are separated from each other, in a multimedia communicationsystem.

In accordance with an aspect of the present disclosure, a method oftransmitting scene composition information by a scene compositioninformation transmitting device in a multimedia communication system isprovided. The method includes creating scene composition informationincluding media attribute information and spatial-temporal information,and transmitting the scene composition information, wherein the mediaattribute information and the spatial-temporal information are separatedinto separate formats.

In accordance with another aspect of the present disclosure, a method ofreceiving scene composition information by a scene compositioninformation receiving device in a multimedia communication system isprovided. The method includes receiving scene composition informationincluding media attribute information and spatial-temporal information,and displaying a scene using the scene composition information, whereinthe media attribute information and the spatial-temporal information areseparated into separate formats.

In accordance with another aspect of the present disclosure, a devicefor transmitting scene composition information in a multimediacommunication system is provided. The device includes a control unitconfigured to create scene composition information including mediaattribute information and spatial-temporal information, and atransmitting unit configured to transmit the scene compositioninformation, wherein the media attribute information and thespatial-temporal information are separated into separate formats.

In accordance with another aspect of the present disclosure, a devicefor receiving scene composition information in a multimediacommunication system is provided. The device includes a receiving unitconfigured to receive scene composition information including mediaattribute information and spatial-temporal information, and a controlunit configured to control to display a scene using the scenecomposition information, wherein the media attribute information and thespatial-temporal information are separated into separate formats.

Embodiments of the present disclosure have an effect of making itpossible to transmit/receive scene composition information in amultimedia communication system. Furthermore, embodiments of the presentdisclosure have an effect of making it possible to transmit/receivescene composition information, created in a form in which mediaattribute information and spatial-temporal information are separatedfrom each other, in a multimedia communication system. Moreover,embodiments of the present disclosure have an effect of making itpossible to transmit/receive scene composition information includingmultimedia initialization information in a multimedia communicationsystem.

A method of providing scene composition information with initializationinformation for a multimedia service according to embodiments of thepresent disclosure may be usefully utilized for consuming the scenecomposition information in a broadcasting environment. Furthermore, amethod of independently composing and updating a part of a sceneaccording to embodiments of the present disclosure may be utilized invarious services such as a multi-view or an N-screen on account ofindependently using the part of the scene. In addition, in making itpossible to maintain continuity of media even when all or a part of ascene is changed, a scene-composition-information transmitting/receivingmethod according to embodiments of the present disclosure has an effectof suggesting a direction for development of other multimedia servicetechnologies requiring a scene composing method.

Other aspects, advantages, and salient features of the disclosure willbecome apparent to those skilled in the art from the following detaileddescription, which, taken in conjunction with the annexed drawings,discloses various embodiments of the present disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects, features, and advantages of certainembodiments of the present disclosure will be more apparent from thefollowing description taken in conjunction with the accompanyingdrawings, in which:

FIG. 1 is a view schematically illustrating a process of presenting anasset using composition information (CI) in an moving picture expertsgroup (MPEG) media transport (MMT) system according to an embodiment ofthe present disclosure;

FIG. 2 is a view schematically illustrating a process of displaying ascene in an MMT system according to an embodiment of the presentdisclosure;

FIG. 3 is a view schematically illustrating a process of presenting ascene, such as the scene of FIG. 2, using CI according to an embodimentof the present disclosure;

FIG. 4 is a view schematically illustrating a structure of spatial andtemporal information of asset (STIA) in an MMT system according to anembodiment of the present disclosure;

FIG. 5 is a view schematically illustrating a structure of STIA andscenes in a time domain in an MMT system according to an embodiment ofthe present disclosure;

FIG. 6 is a view schematically illustrating a structure of a scene andareas in a time domain in an MMT system according to an embodiment ofthe present disclosure;

FIG. 7 is a view schematically illustrating a CI structure in which anSTIA structure, such as the STIA structure as described in FIGS. 4 to 6,is considered according to an embodiment of the present disclosure;

FIG. 8 is a view schematically illustrating the CI structure of FIG. 7in a time domain according to an embodiment of the present disclosure;

FIG. 9 is a view schematically illustrating a CI structure in whichmedia attribute information of an asset may be separated into separateformats using a list of asset (LoA), continuity of an asset may bemaintained even when a scene or an area is changed, and a part of ascene may be independently composed and managed as an area in an MMTsystem according to an embodiment of the present disclosure;

FIG. 10 is a block diagram schematically illustrating an internalstructure of a CI transmitting device in an MMT system according to anembodiment of the present disclosure; and

FIG. 11 is a block diagram schematically illustrating an internalstructure of a CI receiving device in an MMT system according to anembodiment of the present disclosure.

Throughout the drawings, it should be noted that like reference numbersare used to depict the same or similar elements, features, andstructures.

DETAILED DESCRIPTION

The following description with reference to the accompanying drawings isprovided to assist in a comprehensive understanding of variousembodiments of the present disclosure as defined by the claims and theirequivalents. It includes various specific details to assist in thatunderstanding but these are to be regarded as merely exemplary.Accordingly, those of ordinary skill in the art will recognize thatvarious changes and modifications of the various embodiments describedherein can be made without departing from the scope and spirit of thepresent disclosure. In addition, descriptions of well-known functionsand constructions may be omitted for clarity and conciseness.

The terms and words used in the following description and claims are notlimited to the bibliographical meanings, but, are merely used by theinventor to enable a clear and consistent understanding of the presentdisclosure. Accordingly, it should be apparent to those skilled in theart that the following description of various embodiments of the presentdisclosure is provided for illustration purpose only and not for thepurpose of limiting the present disclosure as defined by the appendedclaims and their equivalents.

It is to be understood that the singular forms “a,” “an,” and “the”include plural referents unless the context clearly dictates otherwise.Thus, for example, reference to “a component surface” includes referenceto one or more of such surfaces.

Embodiments of the present disclosure provide a device and a method fortransmitting/receiving scene composition information in a multimediacommunication system.

Furthermore, embodiments of the present disclosure provide a device anda method for transmitting/receiving scene composition information,created in a form in which media attribute information andspatial-temporal information are separated from each other, in amultimedia communication system.

Hereinafter, in describing embodiments of the present disclosure, themultimedia communication system is assumed to be, for example, a movingpicture experts group (MPEG) media transport (MMT) system, and it isapparent that the multimedia communication system may be an evolvedpacket system (EPS), a long-term evolution (LTE) mobile communicationsystem, an institute of electrical and electronics engineers (IEEE)802.16m communication system, or the like, as well as the MMT system.

In the present disclosure, a device and a method will be described fortransmitting and receiving scene composition information, which ispresented using, for example, extensible markup language (XML), in amultimedia communication system. However, the device and the method fortransmitting and receiving scene composition information, proposed byembodiments of the present disclosure, may be applied as they are evenwhen scene composition information presented using other formats as wellas XML is used.

Scene composition information newly proposed by embodiments of thepresent disclosure is defined as composition information (CI). Indisplaying various pieces of media data on one terminal, the CIrepresents information expressing a time and a space on a screen onwhich the media data is displayed and information explaining anassociative relationship between the media data displayed using theinformation expressing the time and the space on the screen on which themedia data is displayed.

Media data to which the CI may be applied, such as video data, audiodata, image data, text data, and the like, is defined as an asset. Alist of assets used to compose one scene in a multimedia service isdefined as a list of asset (LoA).

Information required for accessing an asset and information required foraccurately analyzing a received asset and displaying it are defined asasset information (AI). Here, the information required for accessing theasset represents an address of a media source, and may be, for example,a uniform resource identifier (URI). The information required foraccurately analyzing the received asset and displaying it may include atype, a format, and the like of media, and, for example, when media datais video data, may include a profile, a level, and the likecorresponding to a media format.

A set of spatial information and temporal information for each assetincluded in CI is defined as spatial and temporal information of asset(STIA).

An entire area in which media data is displayed on a screen of aterminal is defined as a scene, and one scene includes one or moreareas. Here, each area may be a partial area of the scene. Spatialinformation for a scene, an area, and an asset is defined as spatialinformation (SI), and temporal information for the scene, the area, andthe asset is defined as temporal information (TI).

Furthermore, in using CI in a multimedia service, embodiments of thepresent disclosure propose a method of separating AI from STIA andproviding it, a method of dividing a scene into areas and composing andmanaging the areas independently of the scene, and a method ofmaintaining continuity of an asset even when a scene or an area ischanged.

A process of presenting an asset using CI in an MMT system according toan embodiment of the present disclosure will be described with referenceto FIG. 1.

FIG. 1 is a view schematically illustrating a process of presenting anasset using CI in an MMT system according to an embodiment of thepresent disclosure.

Referring to FIG. 1, the CI includes media attribute information andtemporal information. The media attribute information is presentedthrough AI of each asset included in LoA, and the temporal informationand spatial information is presented through STIA of each asset includedin the LoA.

When the CI is presented as described above with reference to FIG. 1,media attribute information of an asset required for initialization maybe simply acquired by analyzing only the LoA but not the entire CI.Here, as the CI is divided into the LoA and the STIA in an embodiment ofthe present disclosure, a connection relation between the LoA and theSTIA is defined as asset connection information, and the assetconnection information is referred to as ref Asset.

Furthermore, when the CI is presented as described above with referenceto FIG. 1, if one asset is repetitively displayed in the same sceneseveral times, the CI may additionally include only connectioninformation ref Asset in the CI to reduce the amount of data requiredfor presenting the CI, without having to repetitively present mediaattribute information for the corresponding asset.

With reference to FIG. 1, the process of presenting the asset using theCI in the MMT system according to an embodiment of the presentdisclosure has been described above, and a process of displaying a scenein an MMT system according to an embodiment of the present disclosurewill next be described with reference to FIG. 2.

FIG. 2 is a view schematically illustrating a process of displaying ascene in an MMT system according to an embodiment of the presentdisclosure.

Referring to FIG. 2, the scene includes a plurality of areas, forexample, Area 1, Area 2, and Area 3, and each of the plurality of areasincludes an asset. Area 1 includes three assets, namely, a video asset,an audio asset, and a caption asset, Area 2 includes an image asset, andArea 3 includes a text asset.

Accordingly, as illustrated in FIG. 2, one scene is divided into theplurality of areas, and the assets included in each of the plurality ofareas are subordinated to the corresponding area, so that the areas maybe composed and changed independently of the scene. Consequently, whenthe scene is composed as illustrated in FIG. 2, each area may beindependently used.

Referring to FIG. 2, the process of displaying the scene in the MMTsystem according to an embodiment of the present disclosure has beendescribed above, and a process of presenting a scene using CI will nextbe described with reference to FIG. 3.

FIG. 3 is a view schematically illustrating a process of presenting ascene, such as the scene of FIG. 2, using CI according to an embodimentof the present disclosure.

Referring to FIG. 3, the areas which the scene includes are presented,and the assets included in the corresponding area are presented withinthe area.

Which area each of the areas represents within the corresponding sceneshould be presented, and therefore, SI of the corresponding area isincluded in the CI. Furthermore, the CI includes the SI of the scene inorder to provide a criterion required for presenting the SI of each ofthe areas.

Furthermore, in order to reduce duplicate presentation of the SI, SI ofthe asset identical to the SI of the corresponding area in which theasset is included may be omitted.

As described above, the CI structure illustrated in FIG. 3 makes itpossible to intuitively identify the structure of the scene, the areas,and the assets displayed on a screen, thereby achieving easy creationand edition of the CI.

With reference to FIG. 3, the process of presenting the scene of FIG. 2using the CI has been described above, and a structure of STIA in an MMTsystem according to an embodiment of the present disclosure will next bedescribed with reference to FIG. 4.

FIG. 4 is a view schematically illustrating a structure of STIA in anMMT system according to an embodiment of the present disclosure.

Referring to FIG. 4, there are a plurality of scenes illustrated in theSTIA structure. A plurality of areas are included in each of theplurality of scenes, and a plurality of assets are included in each ofthe plurality of areas. In particular, the STIA structure illustrated inFIG. 4, when there are a plurality of convertible scenes includedtherein, does not need separate additional CI for conversion of theplurality of scenes, and therefore, even though a specific scene of theplurality of scenes is converted, playback related information of anasset may be maintained.

Likewise to this, in the STIA structure illustrated in FIG. 4, whenthere are a plurality of convertible areas in one scene, even though aspecific area of the plurality of areas is converted, playback relatedinformation of an asset may be maintained.

With reference to FIG. 4, the STIA structure in the MMT system accordingto an embodiment of the present disclosure has been described above, anda structure of STIA and scenes in a time domain in an MMT systemaccording to an embodiment of the present disclosure will next bedescribed with reference to FIG. 5.

FIG. 5 is a view schematically illustrating a structure of STIA andscenes in a time domain in an MMT system according to an embodiment ofthe present disclosure.

Referring to FIG. 5, in the time domain, the STIA represents that thescenes are activated in series based on a time axis. Although playbackrelated information of an asset may be maintained irrespective of aconversion of the scenes as described in FIG. 4, a time axis on whichall of the scenes may be considered is additionally required to maintaincontinuity of the asset. Accordingly, as illustrated in FIG. 5, the STIAis implemented in view of the time axis.

With reference to FIG. 5, the structure of the STIA and the scenes inthe time domain in the MMT system according to an embodiment of thepresent disclosure has been described above, and a structure of a sceneand areas in a time domain in an MMT system according to an embodimentof the present disclosure will next be described with reference to FIG.6.

FIG. 6 is a view schematically illustrating a structure of a scene andareas in a time domain in an MMT system according to an embodiment ofthe present disclosure.

Referring to FIG. 6, in the time domain, the STIA represents that theareas are activated in parallel based on a time axis. Although playbackrelated information of an asset may be maintained irrespective of aconversion of the areas as described in FIG. 4, a time axis on which allof the areas may be considered is additionally required to maintaincontinuity of the asset. Accordingly, as illustrated in FIG. 6, the STIAis implemented in view of the time axis, and thus, the scenes are alsoimplemented in view of the time axis.

With reference to FIG. 6, the structure of the scene and the areas inthe time domain in the MMT system according to an embodiment of thepresent disclosure has been described above, and with reference to FIG.7, a CI structure will next be described in which such an STIA structureas described in FIGS. 4 to 6 is considered in an MMT system according toan embodiment of the present disclosure.

FIG. 7 is a view schematically illustrating a CI structure in which anSTIA structure, such as the STIA structure as described in FIGS. 4 to 6,is considered according to an embodiment of the present disclosure.

Referring to FIG. 7, a CI structure in which an STIA structure asdescribed in FIGS. 4 to 6 that reflects the structure of the scenes, theareas, and the assets is considered. First, scenes in the STIA have tobe activated in series. Accordingly, each scene includes TI thereof torepresent an activation time point and a deactivation time point, andwhen a previously described scene is deactivated, a next scene isactivated.

Furthermore, areas included in one scene have to be activated inparallel. Accordingly, each area includes TI thereof to represent anactivation time point and a deactivation time point. Here, when SI andTI existing within each of the scenes, the areas, and the assets aredefined, an external reference may also exist to provide flexibility ofpresentation for the SI and the TI. In this way, in order to reduceduplicate presentation, when an asset has the same activation time asthat of an area including itself, TI of the area may be omitted, andwhen an area has the same activation time as that of a scene includingitself, TI of the area may be omitted. With reference to FIG. 7, the CIstructure in which such an STIA structure as described in FIGS. 4 to 6is considered has been described above, and the CI structure of FIG. 7in a time domain will next be described with reference to FIG. 8.

FIG. 8 is a view schematically illustrating the CI structure of FIG. 7in a time domain according to an embodiment of the present disclosure.

Referring to FIG. 8, although Scene 1 is changed to Scene 2 in 10seconds after activation of STIA, Asset 1 and Asset 2 identically appearin Area 1 and Area 4. In this case, playback time information, whichAsset 1 and Asset 2 included in Scene 1 have, is used for Asset 1 andAsset 2 included in Scene 2 as it is, so that continuity of the assetsmay be maintained.

With reference to FIG. 8, the CI structure of FIG. 7 in the time domainhas been described above, and with reference to FIG. 9, a CI structurewill next be described in which media attribute information of an assetmay be separated into separate formats using an LoA, continuity of anasset may be maintained even when a scene or an area is changed, and apart of a scene may be independently composed and managed as an area inan MMT system according to an embodiment of the present disclosure.

FIG. 9 is a view schematically illustrating a CI structure in whichmedia attribute information of an asset may be separated into separateformats using an LoA, continuity of an asset may be maintained even whena scene or an area is changed, and a part of a scene may beindependently composed and managed as an area in an MMT system accordingto an embodiment of the present disclosure.

Referring to FIG. 9, media attribute information of an asset isseparated into separate formats using an LoA. Furthermore, STIA isimplemented to be separate from the LoA such that continuity of an assetmay be maintained even when a scene or an area is changed, and a part ofa scene may be independently composed and managed as an area.

With reference to FIG. 9, the CI structure has been described above inwhich the media attribute information of the asset may be separated intothe separate formats by using the LoA, the continuity of the asset maybe maintained even when the scene or the area is changed, and the partof the scene may be independently composed and managed as the area inthe MMT system according to an embodiment of the present disclosure, andan internal structure of a CI transmitting device in an MMT systemaccording to an embodiment of the present disclosure will next bedescribed with reference to FIG. 10.

FIG. 10 is a block diagram schematically illustrating an internalstructure of a CI transmitting device in an MMT system according to anembodiment of the present disclosure.

Referring to FIG. 10, a CI transmitting device 1000 includes a receivingunit 1011, a control unit 1013, a transmitting unit 1015, and a storageunit 1017.

The control unit 1013 controls overall operations of the CI transmittingdevice. The control unit 1013 controls to perform overall operationsrelated to an operation of transmitting CI for implementing,particularly, an LoA and STIA according to an embodiment of the presentdisclosure as separate formats. Here, the overall operations related tothe operation of transmitting the CI are the same as those describedwith reference to FIGS. 1 to 9, and therefore, detailed descriptionsthereof will be omitted.

The receiving unit 1011 receives various types of signals from a CIreceiving device, etc. under the control of the control unit 1013. Here,the various types of signals received by the receiving unit 1011 are thesame as those described with reference to FIGS. 1 to 9, and therefore,detailed descriptions thereof will be omitted.

The transmitting unit 1015 transmits various types of signals to the CIreceiving device, etc. under the control of the control unit 1013. Here,the various types of signals transmitted by the transmitting unit 1015are the same as those described with reference to FIGS. 1 to 9, andtherefore, detailed descriptions thereof will be omitted.

The storage unit 1017 stores the various types of signals received bythe receiving unit 1011, and various types of data required for anoperation of the CI transmitting device, particularly, informationrelated to the operation of transmitting the CI.

Meanwhile, although the receiving unit 1011, the control unit 1013, thetransmitting unit 1015, the storage unit 1017, and the output unit 1019are implemented as separate units in FIG. 10, it is apparent that thereceiving unit 1011, the control unit 1013, the transmitting unit 1015,the storage unit 1017, and the output unit 1019 can be implemented asone integrated unit.

With reference to FIG. 10, the internal structure of the CI transmittingdevice in the MMT system according to an embodiment of the presentdisclosure has been described above, and an internal structure of a CIreceiving device in an MMT system according to an embodiment of thepresent disclosure will next be described with reference to FIG. 11.

FIG. 11 is a block diagram schematically illustrating an internalstructure of a CI receiving device in an MMT system according to anembodiment of the present disclosure.

Referring to FIG. 11, a CI receiving device 1100 includes a receivingunit 1111, a control unit 1113, a transmitting unit 1115, and a storageunit 1117.

The control unit 1113 controls overall operations of the CI receivingdevice. The control unit 1013 controls to perform overall operationsrelated to an operation of receiving CI for implementing, particularly,an LoA and STIA according to an embodiment of the present disclosure asseparate formats. Here, the overall operations related to the operationof receiving the CI are the same as those described with reference toFIGS. 1 to 9, and therefore, detailed descriptions thereof will beomitted.

The receiving unit 1111 receives various types of signals from a CItransmitting device under the control of the control unit 1113. Here,the various types of signals received by the receiving unit 1011 are thesame as those described with reference to FIGS. 1 to 9, and therefore,detailed descriptions thereof will be omitted.

The transmitting unit 1115 transmits various types of signals to the CItransmitting device under the control of the control unit 1113. Here,the various types of signals transmitted by the transmitting unit 1115are the same as those described with reference to FIGS. 1 to 9, andtherefore, detailed descriptions thereof will be omitted.

The storage unit 1117 stores the various types of signals received bythe receiving unit 1111, information related to operations of the CIreceiving device, and the like.

Meanwhile, although the receiving unit 1111, the control unit 1113, thetransmitting unit 1115, and the storage unit 1117 are implemented asseparate units in FIG. 11, it is apparent that the receiving unit 1111,the control unit 1113, the transmitting unit 1115, and the storage unit1117 can be implemented as one integrated unit.

It will be appreciated that various embodiments of the presentdisclosure according to the claims and description in the specificationcan be realized in the form of hardware, software or a combination ofhardware and software.

Any such software may be stored in a non-transitory computer readablestorage medium. The non-transitory computer readable storage mediumstores one or more programs (software modules), the one or more programscomprising instructions, which when executed by one or more processorsin an electronic device, cause the electronic device to perform a methodof the present disclosure.

Any such software may be stored in the form of volatile or non-volatilestorage such as, for example, a storage device like a Read Only Memory(ROM), whether erasable or rewritable or not, or in the form of memorysuch as, for example, Random Access Memory (RAM), memory chips, deviceor integrated circuits or on an optically or magnetically readablemedium such as, for example, a Compact Disk (CD), Digital Versatile Disc(DVD), magnetic disk or magnetic tape or the like. It will beappreciated that the storage devices and storage media are variousembodiments of non-transitory machine-readable storage that are suitablefor storing a program or programs comprising instructions that, whenexecuted, implement various embodiments of the present disclosure.Accordingly, various embodiments provide a program comprising code forimplementing apparatus or a method as claimed in any one of the claimsof this specification and a non-transitory machine-readable storagestoring such a program.

While the present disclosure has been shown and described with referenceto various embodiments thereof, it will be understood by those skilledin the art that various changes in form and details may be made thereinwithout departing from the spirit and scope of the present disclosure asdefined by the appended claims and their equivalents.

The invention claimed is:
 1. A method of transmitting scene compositioninformation in a multimedia system, the method comprising: generatingscene composition information comprising a list of asset (LoA) whichincludes media data type information for a plurality of scenes andspatial and temporal information of asset (STIA) which includes spatialand temporal information related to each of a plurality of scenes andspatial and temporal information related to each of a plurality of areascomposing a scene of the plurality of scenes; and transmitting the scenecomposition information, wherein the LoA indicates asset information ofall assets composing the plurality of scenes, and wherein the spatialand temporal information related to the plurality of scenes representsthat the plurality of scenes is activated in series based on a timeaxis, and the plurality of areas included in the scene is activated inparallel based on the time axis.
 2. The method of claim 1, wherein typesof an asset included in each of the plurality of areas are differentfrom each other.
 3. The method of claim 1, wherein media data typeinformation for initialization is acquired by analyzing the LoA.
 4. Themethod of claim 1, wherein the media data type information comprises anaddress of a media source, and wherein the spatial and temporalinformation related to each of the plurality of areas comprises at leastone of a type or a format of media.
 5. A non-transitorycomputer-readable storage medium storing instructions that, whenexecuted, cause at least one processor to perform the method of claim 1.6. The method of claim 1, wherein the LoA is separately described fromthe STIA in the scene composition information.
 7. A method of receivingscene composition information in a multimedia system, the methodcomprising: receiving scene composition information comprising a list ofasset (LoA) which includes media data type information for a pluralityof scenes and spatial and temporal information of asset (STIA) whichincludes spatial and temporal information related to each of a pluralityof scenes and spatial and temporal information related to each of aplurality of areas composing a scene of the plurality of scenes; anddisplaying a scene using the scene composition information, wherein theLoA indicates asset information of all assets composing the plurality ofscenes, and wherein the spatial and temporal information related to theplurality of scenes represents that the plurality of scenes is activatedin series based on a time axis, and the plurality of areas included inthe scene is activated in parallel based on the time axis.
 8. The methodof claim 7, wherein types of an asset included in each of the pluralityof areas are different from each other.
 9. The method of claim 7,wherein media data type information for initialization is acquired byanalyzing the LoA.
 10. The method of claim 7, wherein the media datatype information comprises an address of a media source, and wherein thespatial and temporal information related to each of the plurality ofareas comprises at least one of a type or a format of media.
 11. Adevice for transmitting scene composition information in a multimediasystem, the device comprising: at least one processor configured togenerate scene composition information comprising a list of asset (LoA)which includes media data type information for a plurality of scenes andspatial and temporal information of asset (STIA) which includes spatialand temporal information related to each of a plurality of scenes andspatial and temporal information related to each of a plurality of areascomposing a scene of the plurality of scenes; and a transmitterconfigured to transmit the scene composition information, wherein theLoA indicates asset information of all assets composing the plurality ofscenes, and wherein the spatial and temporal information related to theplurality of scenes represents that the plurality of scenes is activatedin series based on a time axis, and the plurality of areas included inthe scene is activated in parallel based on the time axis.
 12. Thedevice of claim 11, wherein types of an asset included in each of theplurality of areas are different from each other.
 13. The device ofclaim 11, wherein media data type information for initialization isacquired by analyzing the LoA.
 14. The device of claim 11, wherein themedia data type information comprises an address of a media source, andwherein the spatial and temporal information related to each of theplurality of areas comprises at least one of a type or a format ofmedia.
 15. A device for receiving scene composition information in amultimedia communication system, the device comprising: a receiverconfigured to receive scene composition information comprising a list ofasset (LoA) which includes media data type information for a pluralityof scenes and spatial and temporal information of asset (STIA) whichincludes spatial and temporal information related to each of a pluralityof scenes and spatial and temporal information related to each of aplurality of areas composing a scene of the plurality of scenes; and atleast one processor configured to control a display of a scene using thescene composition information, wherein the LoA indicates assetinformation of all assets composing the plurality of scenes, and whereinthe spatial and temporal information related to the plurality of scenesrepresents that the plurality of scenes is activated in series based ona time axis, and the plurality of areas included in the scene isactivated in parallel based on the time axis.
 16. The device of claim15, wherein types of an asset included in the plurality of areas aredifferent from each other.
 17. The device of claim 15, wherein mediadata type information for initialization is acquired by analyzing theLoA.
 18. The device of claim 15, wherein the media data type informationcomprises an address of a media source, and wherein the spatial andtemporal information related to each of the plurality of areas comprisesat least one of a type or a format of media.
 19. The device of claim 15,wherein the LoA is separately described from the STIA in the scenecomposition information.